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8.3.3 Confidence Intervals for Normal Samples 

In the above discussion, we assumed n to be large so that we could use the CLT. An interesting aspect of the confidence intervals that we 
obtained was that they often did not depend on the details of the distribution from which we obtained the random sample. That is, the 

confidence intervals only depended on statistics such as X and S 2 . What if n is not large? In this case, we cannot use the CLT, so we need to use 
the probability distribution from which the random sample is obtained. A very important case is when we have a sample X^, X 2 , X 2 ,..., X 
from a normal distribution. Here, we would like to discuss how to find interval estimators for the mean and the variance of a normal 
distribution. Before doing so, we need to introduce two probability distributions that are related to the normal distribution. These distributions 
are useful when finding interval estimators for the mean and the variance of a normal distribution. 

Chi-Squared Distribution 

Let us remember the gamma distribution. A continuous random variable X is said to have a gamma distribution with parameters a > 0 and 
X > 0, shown as X ~ Gamma(a, X), if its PDF is given by 


0 otherwise 



Now, we would like to define a closely related distribution, called the chi-squared distribution. We know that if Zj, Z 2 , ••*, Z n are independent 
standard normal random variables, then the random variable 

X = Z, + Z~ + ••• + Z 

1 2 n 

is also normal. More specifically, X ~ N( 0, n ). Now, if we define a random variable Y as 

Y = Z\ + z\ + - + Z 2 n , 

then Y is said to have a chi-squared distribution with n degrees of freedom shown by 

Y ~ x \n). 

n 1 

It can be shown that the random variable 7 has, in fact, a gamma distribution with parameters a = y and X = ^, 

Y ~ Gamma 



Figure 8.5 shows the PDF of/ 2 («) distribution for some values of n. 
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Figure 8.5 - The PDF of/ 2 (n) distribution for some values of n. 


So, let us summarize the definition and some properties of the chi-squared distribution. 


https://www.probabilitycourse.com/chapter8/8_3_3_confidenceJntervals_for_norm_samples.php 


2/11 





9/18/2018 


Confidence Intervals for Normal Samples 


The Chi-Squared Distribution 

Definition 8.1. If Z 2 , ••*, Z n are independent standard normal random variables, the random variable Y defined as 

Y=Z 2 l +Z 2 2 + -+Z 2 n 

is said to have a chi-squared distribution with n degrees of freedom shown by 

Y ~ x \n ). 

Properties: 

1. The chi-squared distribution is a special case of the gamma distribution. More specifically, 


Thus, 


Y ~ Gamma 


1 _i 

fy(y) = - y 2 V 2, for y > 0. 

22 r 




2. EY= n, Var(f) = 2n. 

2 

3. For any p E [0, 1] and n E N, we define x p n as the real value for which 

P( y >X 2 p ,„)=P, 

2 2 2 

where Y ~ / (w). Figure 8.6 shows % n . In MATLAB, to compute/^, n you can use the following command: chi2inv(l - p, n) 
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2 

Figure 8.6 - The definition ofx p ^ n - 

Now, why do we need the chi-squared distribution? One reason is the following theorem, which we will use in estimating the variance of 
normal random variables. 

Theorem 8.3. Let X±,X 2 , •••, X n be i.i.d. Nip., g) random variables. Also, let S 2 be the standard variance for this random sample. Then, the 
random variable Y defined as 


(n - 1)S 2 1 _ 

y =—— = -I(*i -*) 2 

G G i= i 


has a chi-squared distribution with n — 1 degrees of freedom, i.e., Y ~ / 2 ( n ~ !)• Moreover, X and S 2 are independent random variables. 


The ^-Distribution 

The next distribution that we need is the Student's t-distribution (or simply the t-distribution). Here, we provide the definition and some 
properties of the /-distribution. 

The t-Distribution 

Definition 8.2. Let Z ~ N( 0, 1), and Y ~ / 2 (w), where n E N. Also assume that Z and Y are independent. The random variable T defined as 


is said to have a /-distribution with n degrees of freedom shown by 

T ~ T{n). 

Properties: 

1. The /-distribution has a bell-shaped PDF centered at 0, but its PDF is more spread out than the normal PDF (Figure 8.7). 

2. ET = 0, for n > 0. But ET, is undefined for n = 1. 

3. Var(7) = ~~ 2 , for n > 2. But, Var(7) is undefined for n = 1,2. 

4. As n becomes large, the / density approaches the standard normal PDF. More formally, we can write 

d 

Tin) NiO, 1). 

5. For any p E [0, 1] and «6N, we define t p n as the real value for which 

P(T>t pn )=p. 

Since the /-distribution has a symmetric PDF, we have 


t\-p,n tp,n‘ 

In MATLAB, to compute t p n you can use the following command: tinv(l — p, n). 

Figure 8.7 shows the PDF of /-distribution for some values of n and compares them with the PDF of the standard normal distribution. As we 
see, the / density is more spread out than the standard normal PDF. Figure 8.8 shows t p n . 
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T ~ T(n) 



Why do we need the /-distribution? One reason is the following theorem which we will use in estimating the mean of normal random variables. 
Theorem 8.4. Let X^, X 2 , “',X n be i.i.d. N(p., a) random variables. Also, let S 2 be the standard variance for this random sample. Then, the 
random variable T defined as 


X~n 

S/^n 


has a /-distribution with n - \ degrees of freedom, i.e., T ~ T(n — 1). 
Proof: 

Define the random variable Z as 
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Z = 


X-fi 

<j/yn 


Then, Z ~ V(0, 1). Also, define the random variable Y as 

r 

<r 2 ' 

Then by Theorem Theorem 8.3 . Y ~ / 2 (n — 1). We conclude that the random variable 



has a ^-distribution with n— 1 degrees of freedom. 

Confidence Intervals for the Mean of Normal Random Variables 

Here, we assume that X^,X 2 ,X 2 ,... ,X n is a random sample from a normal distribution N(ju, a 2 ), and our goal is to find an interval estimator 
for //. We no longer require n to be large. Thus, n could be any positive integer. There are two possible scenarios depending on whether a 2 is 
known or not. If the value of a 2 is known, we can easily find a confidence interval for /i. This can be done using exactly the same method that 
we used to estimate n for a general distribution for the case of large n. More specifically, we know that the random variable 


X-u 

S = -r 
a/-\jn 

has N( 0, 1) distribution. In particular, Q is a function of the X-s and /i, and its distribution does not depend on fi. Thus, Q is a pivotal quantity, 


and we conclude that 


X— z-~p,X+ z- ~i= 
2 yn 2 


is (1 - a) 100% confidence interval for /i. 


Assumptions: A random sampleXj, X 2 , X 2 ,... ,X is given from a N(ju, cr 2 ) distribution, where Var^) = o 2 is known . 


Parameter to be Estimated: /a. = EX;. 


Confidence Interval: 


a a 

X— zf-7=X+z“-p 

2 2 


is a (1 — a) 100% confidence interval for ja. 


The more interesting case is when we do not know the variance a 2 . More specifically, we are given Xj, X 2 , X 2 ,..., X n , which is a random 

sample from a normal distribution N(ja, a 2 ), and our goal is to find an interval estimator for /i. However, o 2 is also unknown. In this case, using 
Theorem 8,4 . we conclude that the random variable T defined as 


X-ju 
T SI '\Jl7 

has a ^-distribution with- 1 degrees of freedom, i.e., T ~ T(n — 1). Here, the random variable T is a pivotal quantity, since it is a function of 
the X-s and //, and its distribution does not depend on // or any other unknown parameters. Now that we have a pivot, the next step is to find a 
(1 - a) interval for T. Using the definition of t p n , a (1 - a) interval for T can be stated as 


P\~t a - 


< T < t‘l 


= 1 - a. 


Therefore, 
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X-u 

P| —t “ _. < - — <tt i I = 1 — «, 

2 ’" S/a/m 2 ’" I 


which is equivalent to 




We conclude that 


s s 

X — t“ i —p.X+t'L . — p 

2 ,« 1 2 ,n * 


is (1 — a) 100% confidence interval for fi. 


Assumptions: A random sample X^,X 2 , X 2 ,... ,X is given from a N(ju, a 2 ) distribution, where fi = EX. and Var(X.) = o 2 are unknown . 


Parameter to be Estimated: fi = EX t . 


Confidence Interval: 





is a (1 — a) confidence interval for //. 


Example 8.20 

A farmer weighs 10 randomly chosen watermelons from his farm and he obtains the following values (in lbs): 

7.72 9.58 12.38 7.77 11.27 8.80 11.10 7.80 10.17 6.00 

Assuming that the weight is normally distributed with mean n and and variance <7, find a 95% confidence interval for /u. 
• Solution 

o Using the data we obtain 


X= 9.26, 
S 2 = 3.96 


Here, n = 10, a = 0.05, so we need 


^0.025,9 ~ 2.262 

The above value can be obtained in MATLAB using the command tinv(0.975, 9). Thus, we can obtain a 95% confidence interval 
for fi as 


X t J,n-l.r> X+t t,n-l r 
2 yn 2 a/m 


9.26 - 2.26 ■ 


a/I96 

VTo 


, 9.26 + 2.26 ■ 


a/To 


= [7.84, 10.68], 


Therefore, [7.84, 10.68] is a 95% confidence interval for fi. 


Confidence Intervals for the Variance of Normal Random Variables 


Now, suppose that we would like to estimate the variance of a normal distribution. More specifically, assume thatX l ,X 2 , X 2 ,..., X n is a 

random sample from a normal distribution N(fi, a 2 ), and our goal is to find an interval estimator for o 2 . We assume that fi is also unknown. 
Again, n could be any positive integer. By Theorem 8.3 . the random variable Y defined as 


Q = 


(n - 1)S Z 1 




- 2 /Ti 
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has a chi-squared distribution with n - 1 degrees of freedom, i.e., Q ~ / 2 (n — 1). In particular, Q is a pivotal quantity since it is a function of 
o . . . . o . 2 

the X-s and a , and its distribution does not depend on o or any other unknown parameters. Using the definition of/^ n , a (1 — a) interval for 
Q can be stated as 


Therefore, 


which is equivalent to 


P\x] « <Q<X 2 * , 1=1-a. 

1 -2 ,n~l 2 ,n_ / 


9 (w- 1 )S Z 9 

« < -2- ~ X “ , I = 1 ~ a - 

1 — j , 1 O 2 ,n ^ l 


(n-l)S 2 , (n - 1 )5 2 


2 - w - 2 


1 - a. 


We conclude that 


2 » 2 


is a (1 — a) 100% confidence interval for a. 


Assumptions: A random sample X i ,X 2 , X 2 ,... ,X is given from a N(ju, cr 2 ) distribution, where n = EX i and Var(X) = a 2 are unknown . 


Parameter to be Estimated: Var(X) = o l 


Confidence Interval: 


(n-l)5 2 («-l)5 2 



is a (1 - a) 100% confidence interval for o 2 . 


Example 8.21 

For the data given in Example 8.20 . find a 95% confidence interval for o 2 . Again, assume that the weight is normally distributed with mean 
and and variance a, where fi and o are unknown. 

• Solution 

o As before, using the data we obtain 


X= 9.26, 

S 2 = 3.96 

Here, n = 10, a = 0.05, so we need 

*0.025,9 = 19.02, X 0.975,'9 = ^-70 

The above values can obtained in MATLAB using the commands chi2inv(0.975, 9) and chi2inv(0.025, 9), respectively. Thus, we 
can obtain a 95% confidence interval for a 2 as 


(h-1)S 2 (n - 1)S 2 


2 ’ 2 
, / « . 
-,o-l 1 - -i , n - 1 


9 x 3.96 9 x 3.96 


19.02 ’ 2.70 


[1.87, 13.20], 


https://www.probabilitycourse.com/chapter8/8_3_3_confidence_intervals_for_norm_samples.php 


10/11 



9/18/2018 


Confidence Intervals for Normal Samples 


Therefore, [ 1.87, 13.20] is a 95% confidence interval forrr 2 . 


«— previous 
next —» 
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